Prediction of Ligand Binding sites in RNA binding protein Pockets using support vector machines

نویسندگان

  • Rahul Singh
  • Tiratha Raj Singh
چکیده

RNA-binding proteins play a significant role in pattern regulation of gene expression during developmental phases. Therefore in order to facilitate our understanding of organism development there is a continuous need to develop an extensive a priori method for the prediction of RNA-binding protein pockets. We present here a SVM (Support Vector Machine) based approach for successful prediction of these pockets. The method employs two datasets: the protein sequences of the RNA binding protein pockets and the non-RNA binding protein pockets, both of which when combined to form the positive and negative datasets to be fed into the SVM model. Before feeding the data to the SVM, both the datasets were crossed with several steps of sorting, which refined the selection process of obtaining ranked features of these datasets. Analysis was applied on 3 different featured datasets viz FPOCKET, Zernike and shell features. The results suggest that the top 10 features of shell are very important and play a pivotal role in the classification and prediction of ligand binding sites in RNA binding proteins. An accuracy of 89.3% was achieved when evaluated. This study demonstrates that it is possible to predict ligand binding sites in RNA binding protein pockets using its sequence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of RNA-binding sites in artemin based on docking energy landscapes and molecular dynamics simulation

There are questions concerning the functions of artemin, an abundant stress protein found in Artemiaduring embryo development. It has been reported that artemin binds RNA at high temperatures in vitro, suggesting an RNA protective role. In this study, we investigated the possibility of the presence of RNA-bindingsites and their structural properties in artemin, using docking energy ...

متن کامل

Investigation the Mechanism of Interaction between Inhibitor ALISERTIB with Protein Kinase A and B Using Modeling, Docking and Molecular Dynamics Simulation

The high level of conservation in ATP-binding sites of protein kinases increasingly demandsthe quest to find selective inhibitors with little cross reactivity. Kinase kinases are a recently discovered group of Kinases found to be involved in several mitotic events. These proteins represent attractive targets for cancer therapy with several small molecule inhibitors undergoing different ph...

متن کامل

In silico investigation of lactoferrin protein characterizations for the prediction of anti-microbial properties

Lactoferrin (Lf) is an iron-binding multi-functional glycoprotein which has numerous physiological functions such as iron transportation, anti-microbial activity and immune response. In this study, different in silico approaches were exploited to investigate Lf protein properties in a number of mammalian species. Results showed that the iron-binding site, DNA and RNA-binding sites, signal pepti...

متن کامل

TargetATPsite: A template-free method for ATP-binding sites prediction with residue evolution image sparse representation and classifier ensemble

Understanding the interactions between proteins and ligands is critical for protein function annotations and drug discovery. We report a new sequence-based template-free predictor (TargetATPsite) to identify the Adenosine-5'-triphosphate (ATP) binding sites with machine-learning approaches. Two steps are implemented in TargetATPsite: binding residues and pockets predictions, respectively. To pr...

متن کامل

BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences

BindN (http://bioinformatics.ksu.edu/bindn/) takes an amino acid sequence as input and predicts potential DNA or RNA-binding residues with support vector machines (SVMs). Protein datasets with known DNA or RNA-binding residues were selected from the Protein Data Bank (PDB), and SVM models were constructed using data instances encoded with three sequence features, including the side chain pK(a) ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014